Skip to content

Improve reference files: graduated challenge, story freshness, calibration proxy, transcript formats#32

Closed
dbhat93 wants to merge 1 commit intonoamseg:mainfrom
dbhat93:contribute-upstream
Closed

Improve reference files: graduated challenge, story freshness, calibration proxy, transcript formats#32
dbhat93 wants to merge 1 commit intonoamseg:mainfrom
dbhat93:contribute-upstream

Conversation

@dbhat93
Copy link
Copy Markdown

@dbhat93 dbhat93 commented Mar 2, 2026

Summary

Improvements to 6 reference files and the decode + feedback commands, built while deeply using the skill. These were developed collaboratively and pressure-tested before submission.

What's included

references/challenge-protocol.md (new file)

  • Graduated activation levels (L3=Assumption Audit only, L4=+Blind Spot, L5=all 5 lenses) — challenge begins earlier and becomes familiar rather than jarring
  • Challenge Debt: tracking mechanism for unresolved challenges across sessions (records, surfaces at next relevant session, expires after 3 sessions)
  • 6 command-specific invocations with level-specific behavior

references/calibration-engine.md (new file)

  • Practice Calibration Proxy (Section 8): plateau detection without real interviews — triggers when 5+ practices show ≤0.3 score variance over 3 sessions. New status: practice-calibrating
  • Score Confidence Display (Section 9): Low/Medium/High confidence table with when-to-surface guidance

references/story-mapping-engine.md (new file)

  • Framing Variety Constraint with 3-criteria Substantiation Test replacing the "no story twice" rule — a strong story told twice with different angles beats forcing a weaker story into one slot
  • Story Freshness Score (Section 5): 3-factor decay model (use frequency risk, experience age, Proven Performer offset) with 4 verdicts (Fresh / Fresh(Proven) / Moderate / Stale)
  • Freshness column added to output schema mapping matrix

references/transcript-formats.md (new file)

  • Added Fireflies.ai as a 9th supported format (was missing)
  • Zoom Webinar distinction (panelist vs. attendee label conventions)
  • Granola quality signal: cross-check AI summary against raw transcript to detect truncation or AI hallucination
  • Format Fingerprint: 5-signal fallback for ambiguous cases before defaulting to Manual

references/commands/decode.md (new file)

  • Pathway dimension (Referral/Warm intro/LinkedIn/Cold/Internal) added to batch triage with explicit tier-boost logic (+1 tier for Referral/Warm, +0.5 for Internal)
  • JD Honest Limitations surfaced as named section in every Standard/Deep output — not buried

references/commands/feedback.md (new file)

  • Memory reliability tagging for Type D (Same-day=high, 1–2 days=medium, 3+ days=low)
  • Guided elicitation prompts for thin Type D memories (4 specific questions to surface what's actually there)
  • Rejection Leverage defers to graduated challenge levels instead of hardcoding L5

Test plan

  • Review each reference file for internal consistency
  • Verify command-specific invocations in challenge-protocol match the invocation points in stories, analyze, practice, progress, hype, feedback
  • Check that transcript-formats disambiguation rules cover all 9 formats
  • Confirm story-mapping-engine integration points reference correct command files

🤖 Generated with Claude Code

…ation proxy, transcript formats

challenge-protocol.md:
- Graduated activation (Level 3 = Assumption Audit only, Level 4 = + Blind
  Spot, Level 5 = all 5 lenses) instead of binary Level 5 only
- Challenge Debt: unresolved challenges tracked across sessions and surfaced
  at the start of the next relevant session instead of silently dropped
- Avoidance Confrontation now has distinct Level 3/4 behaviors, not just
  a single Level 5 mode

story-mapping-engine.md:
- Framing Variety Constraint replaces "no story twice" rule: a story may
  appear twice in a mapping if the angles are substantively different,
  tested via a 3-criteria Substantiation Test (different competency,
  different STAR section, different interviewer takeaway)
- Story Freshness Score: 3-factor decay function (use frequency risk,
  experience age risk, Proven Performer offset) replacing raw use-count
  overuse check. Verdict: Fresh / Fresh (Proven) / Moderate / Stale

calibration-engine.md:
- Practice Calibration Proxy (Section 8): plateau detection without real
  interview outcomes — when same answer type practiced 5+ times and scores
  vary ≤0.3 across 3 consecutive sessions, flag as practice ceiling and
  prescribe behavioral change. New status: practice-calibrating.
- Score Confidence Display (Section 9): scores carry calibration status
  context (Low/Medium/High confidence) so candidates don't treat estimates
  as ground truth
- De-escalation Check: root causes are explicitly resolved when affected
  dimensions improve, preventing over-diagnosis drift
- Explicit transition rule: practice-calibrating → calibrating when 3+
  real outcomes arrive

transcript-formats.md:
- Added Fireflies.ai as 9th supported format with disambiguation rules
  separating it from Grain
- Zoom Webinar distinction: Panelist: prefix handling
- Format Fingerprint for ambiguous cases: 5-signal fingerprint + medium-
  confidence detection decision, gives candidate basis to override
- Granola quality signal: AI summary cross-checked against raw transcript,
  flags possible truncation or hallucination

decode.md:
- Pathway-weighted batch triage: Referral / Warm intro / LinkedIn /
  Cold / Internal dimension added to ranking. A Long-Shot Stretch with
  a Referral ranks above an Investable Stretch cold application.
- JD Honest Limitations surfaced in every Standard/Deep output as a named
  section (not just reference knowledge). Quick Scans include a one-line
  reminder. Candidates never walk away treating a decode as a complete picture.

feedback.md:
- Memory reliability tagging for Type D (Post-Session Memory): Same-day /
  1-2 days / 3+ days tags applied to Question Bank entries
- Guided elicitation prompts for thin memories: surfaces what's actually
  there rather than accepting vague recall as complete capture
- Rejection Leverage defers to graduated Challenge Protocol levels instead
  of hardcoding Level 5 behavior

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@dbhat93
Copy link
Copy Markdown
Author

dbhat93 commented Apr 1, 2026

Closing this — the changes here have been superseded by PR #35, which consolidates all v3.1-v3.3 work into a single PR. Thanks!

@dbhat93 dbhat93 closed this Apr 1, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants